Di-codon usage for classification of genes

نویسندگان

  • Minh Ngoc Nguyen
  • Jianmin Ma
  • Gary B. Fogel
  • Jagath C. Rajapakse
چکیده

Genes are often classified into biologically related groups so that inferences on their functions can be made. This paper demonstrates that the di-codon usage is a useful feature for gene classification and gives better classification accuracy than the codon usage. Our experiments with different classifiers show that support vector machines performs better than other classifiers in classifying genes by using di-codon usage as features. The method is illustrated on 1841 HLA sequences which are classified into two major classes, HLA-I and HLA-II, and further classified into the subclasses of major classes. By using both codon and di-codon features, we show near perfect accuracies in the classification of HLA molecules into major classes and their sub-classes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Bioinformatics Comparison of Codon Usage of Genes Encoding Phosphate Transporter in Terms of Salt Tolerance, Day Length, Temperature and Pollination in Different Plants

In order to study and compare the phosphate transporter gene codon usage and it's respond to the traits like salt tolerance, day length, Pollination and temperature in different plants, 100 isoform from 10 plants are extracted from NCBI website and then analyzed with Gene Infinity and Minitab 16 software. The result shows that the highest codon usage similarity (81.95%) was for wheat a...

متن کامل

Identification of Synonymous Codon Usage Bias in the Pseudorabies Virus UL31 Gene

Background: Little knowledge of synonymous codon usage pattern of pseudorabies virus (PRV) genome, especially the UL31 gene in the process for its evolution is available. Objectives: In the present study, the codon usage bias between PRV UL31 sequence and the UL31-like sequences was identified. Materials and Methods: We used a comprehensive analysi...

متن کامل

Bioinformatics Comparison of Codon Usage of Genes Encoding Phosphate Transporter in Terms of Salt Tolerance, Day Length, Temperature and Pollination in Different Plants

In order to study and compare the phosphate transporter gene codon usage and it's respond to the traits like salt tolerance, day length, Pollination and temperature in different plants, 100 isoform from 10 plants are extracted from NCBI website and then analyzed with Gene Infinity and Minitab 16 software. The result shows that the highest codon usage similarity (81.95%) was for wheat a...

متن کامل

Comparative bioinformatics analysis of a wild diploid Gossypium with two cultivated allotetraploid species

Background: Gossypium thurberi is a wild diploid species that has been used to improve cultivated allotetraploid cotton. G. thurberi belongs to D genome, which is an important wild bio-source for the cotton breeding and genetic research. To a certain degree, chloroplast DNA sequence information are a versatile tool for species identification and phylogenetic implications in plants. Different ch...

متن کامل

Mutational Pressure Drives Evolution of Synonymous Codon Usage in Genetically Distinct Oenothera plastomes

Background: Most of the amino acids are encoded by more than one codon, termed as synonymous codons. Synonymous codon usage is not random as it is unique to species. In each amino acid family, some synonymous codons are preferred and this is referred to as synonymous codon usage bias (SCUB). Trends associated with evolution of SCUB and factors influencing its diversification in plastomes of gen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bio Systems

دوره 98 1  شماره 

صفحات  -

تاریخ انتشار 2009